QuickNet: Maximizing Efficiency and Efficacy in Deep Architectures

نویسنده

  • Tapabrata Ghosh
چکیده

We present QuickNet, a fast and accurate network architecture that is both faster and significantly more accurate than other “fast” deep architectures like SqueezeNet. Furthermore, it uses less parameters than previous networks, making it more memory efficient. We do this by making two major modifications to the reference “Darknet” model (Redmon et al, 2015): 1) The use of depthwise separable convolutions and 2) The use of parametric rectified linear units. We make the observation that parametric rectified linear units are computationally equivalent to leaky rectified linear units at test time and the observation that separable convolutions can be interpreted as a compressed Inception network (Chollet, 2016). Using these observations, we derive a network architecture, which we call QuickNet, that is both faster and more accurate than previous models. Our architecture provides at least four major advantages: (1) A smaller model size, which is more tenable on memory constrained systems; (2) A significantly faster network which is more tenable on computationally constrained systems; (3) A high accuracy of 95.7% on the CIFAR-10 Dataset which outperforms all but one result published so far, although we note that our works are orthogonal approaches and can be combined (4) Orthogonality to previous model compression approaches allowing for further speed gains to be realized.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Clinical comparison of mechanical and chemomechanical methods in removing deep dentinal caries

Clinical comparison of mechanical and chemomechanical methods in removing deep dentinal caries Dr. F. Darabi* - Dr. N. Kia Rostami** *- Assistant Professor of Operative Dentistry Dept. - Faculty of Dentistry - Guilan University of Medical Sciences. ** - Dentist. Background and Aim: The use of CarisolvTM decreases unnecessary removal of sound dental tissue and reduces the possibility of pulpal e...

متن کامل

A Deep Learning Analytic Suite for Maximizing Twitter Impact

We present a series of deep learning models for predicting user engagement with twitter content, as measured by the number of retweets for a given tweet. We train models based on classic LSTM-RNN and CNN architectures, along with a more complex bi-directional LSTM-RNN with attention layer. We show that the attention RNN performs the best with 61% validation accuracy, but that all three deep lea...

متن کامل

Influences of Device Architectures on Characteristics of Organic Light-Emitting Devices Incorporating Ambipolar Blue-Emitting Ter(9,9-diarylfluorenes)

In this article, we report the studies of various device architectures of organic lightemitting devices (OLEDs) incorporating highly efficient blue-emitting and ambipolar carriertransport ter(9,9-diarylfluorene)s, and their influences on device characteristics. The device structures investigated include single-layer devices and multilayer heterostructure devices employing the terfluorene as one...

متن کامل

Reversible Architectures for Arbitrarily Deep Residual Neural Networks

Recently, deep residual networks have been successfully applied in many computer vision and natural language processing tasks, pushing the state-of-the-art performance with deeper and wider architectures. In this work, we interpret deep residual networks as ordinary differential equations (ODEs), which have long been studied in mathematics and physics with rich theoretical and empirical success...

متن کامل

Surface hardness improvement in high efficiency deep grinding process by optimization of operating parameters

The grinding is one of the most important methods that directly affects tolerances in dimensions, quality and finished surface of products. One of the major problems in the material removal processes specially grinding is the heat generation during the process and the residual tensile stress in the surfaces of product. Therefore, optimization of High Efficiency Deep Grinding (HEDG) process is t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1701.02291  شماره 

صفحات  -

تاریخ انتشار 2017